Part II - (Prosper Loan Data)¶

by (Tolulope Ogunfuwa)¶

Investigation Overview¶

This investigation was done to review the different features that are related to loan acquisition and disbursement. The goal was to observe the different relationships between these features and at the end deduce the elements that are most important in the arrangement of loans.

Dataset Overview¶

The working dataset of contains 83982 loans with 21 features was sieved out from 113937 loans and 81 features. Examples of the features include: Borrower Annual Percentage Rate, Borrower Rate, Employment Status, Occupation, Prosper Rating, Original Loan Amount, etc.

Distribution of loans listed by Year and month¶

January, October, December, November, & February are the five most active months for loan collection. These months are either toward the end of a year or the beginning of a new year. It can be suggested that people like to borrow loans in those periods to prepare for the festive season as well as prepare for what the new year holds in terms of personal projects, school fees, & other expenses.

The distribution for the Year indicates that most of the loans were collected in 2013. There's not enough information to give a suggestion to why 2013 experienced an overwhelming number of loan collection.

RELATIONSHIP BETWEEN BORROWER'S ANNUAL PERCENTAGE RATE (APR) & BORROWER RATE¶

The distribution for Borrower APR indicates that it is multimodal.

Across the distribution, we see about four different peak periods. One is between 0.08 & 0.09. The 2nd is between 0.2 & 0.22. The third is between 0.28 & 0.29. And the last one, with the most counts, is between 0.34 & 0.36. The distribution mainly spans between 0.1 & 0.4

The distribution for the Borrower's Rate is almost identical to that of Borrower's Annual Percentage Rate (APR). They share the similar pattern of having multiple modes within the same range.

THE DISTRIBUTION OF CATEGORICAL FEATURES (EMPLOYMENT STATUS & PROSPER RATING¶

80% of borrowers are employes which suggests that being employed is significant to getting a loan.

Most of the people fall within ratings E to A. The remaining percentages are either people with the lowest credit risk (AA) or the highest credit risk (HR)